Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning
نویسندگان
چکیده
Meta-reinforcement learning enables artificial agents to learn from related training tasks and adapt new efficiently with minimal interaction data. However, most existing research is still limited narrow task distributions that are parametric stationary, does not consider out-of-distribution during the evaluation, thus, restricting its application. In this paper, we propose MoSS, a context-based algorithm based on Self-Supervised representation address challenge. We extend meta-RL broad non-parametric which have never been explored before, also achieve state-of-the-art results in non-stationary tasks. Specifically, MoSS consists of inference module policy module. utilize Gaussian mixture model for imitate variations. Additionally, our online adaptation strategy agent react at first sight change, thus being applicable exhibits strong generalization robustness out-of-distributions benefits reliable robust representation. The built top an off-policy RL entire network trained completely ensure high sample efficiency. On MuJoCo Meta-World benchmarks, outperforms prior works terms asymptotic performance, efficiency (3-50x faster), efficiency, diverse distributions.
منابع مشابه
A Comparison of Supervised and Reinforcement Learning Methods on a Reinforcement Learning Task
The \forward modeling" approach of Jor-dan and Rumelhart has been shown to be applicable when supervised learning methods are to be used for solving reinforcement learning tasks. Because such tasks are natural candidates for the application of reinforcement learning methods, there is a need to evaluate the relative merits of these two learning methods on reinforcement learning tasks. We present...
متن کاملRobot Task Learning based on Reinforcement Learning in Virtual Space
− As a novel learning method, reinforced learning by which a robot acquires control rules through trial and error has gotten a lot of attention. However, it is quite difficult for robots to acquire control rules by reinforcement learning in real space because many learning trials are needed to achieve the control rules; the robot itself may lose control, or there may be safety problems with the...
متن کاملOperation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملRepresentation Discovery for Kernel-Based Reinforcement Learning
Recent years have seen increased interest in non-parametric reinforcement learning. There are now practical kernel-based algorithms for approximating value functions; however, kernel regression requires that the underlying function being approximated be smooth on its domain. Few problems of interest satisfy this requirement in their natural representation. In this paper we define value-consiste...
متن کاملTask-Oriented Reinforcement Learning
Acknowledgement This thesis is the result of two years of work whereby I have been accompanied and supported by many people. I am extremely indebted to Dr.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i8.26210